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I. Requirements of the Clickstream System 

The Clickstream System is a software system residing on several hardware platforms within the BellSouth 
Interactive Video Services Network (IVSN). The system is being designed for the BellSouth Residential Broadband 
Trial which will take place in the city of ChamHee, Georgia from May, 1995 through May, 1997. The purpose of 
the system is three fold 

1) To provide Che capability for the collection of knowledge about the subscriber usage of the BellSouth System 
and Services. This is called Clickstream information. This knowledge will allow analysis of data to generate 
detailed viewing habits, programming ratings, and detailed demographics of the subscriber population. 

2) To provide a method (or methods ) to merge content identifier Metadata with the Clickstream information. 

3) To provide a storage and organization database system to manage and analyze the Clickstream and content data, 
and to provide an upload facility to 3rd party analysis entities, such as NCM I 3 /Optimark. This system has been 
commonly referred to as the MarKeting and Information System (MKIS). 

These 3 main functions of the Clickstream system are tailored to support the analysis of subscriber Interactive Video 
Services Network (IVSN) usage. Some analysis capability will be supported by the MKIS system, but main 
analysis function will be provided off-line by a third party (NCM/BULL/Arbitron) system referred to as I 3 / 
Optimark. For a more detailed account of data analysis function, refer to NCM/BULL/Arbitron documentation. 
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2. Clickstream Hardware/Software Entities 

The software subsystems to support the Clickstream collection. Metadata merging, and data storage run on several 
hardware platforms as outlined in Figure 1. The subsystems communicate with each other in various ways to 
transmit Gickstream information, data error detection schemes, and data acknowledgments. Only the flow of 
Clickstream pertinent data and Metadata are displayed in figure 1. The software subsystems are divided functionally 
within each platform to aid in more parallel design, development and testing efforts. 




Figure 1: Clickstream System Data Flow 



Note: Arrows represent flow of Clickstream data only 



Private/Proprietary to BellSouth Interactive Media 



5 : 




BellSouth Interact! re Media Services 

Clickstream System Snec DRAFT vl.30 8/18/95 

2.L Software Subsystems 

The following is an overview of expected software subsystems, in-depth functionality will be covered in following 
sections. 



2.1,1. STB Platform 

There are several parts to the STB based Clickstream Processor code: 

1) Clickstream Kernel 

2) Double Buffer of Clickstream Events 

3) Clickstream Upload Handler 

4) Clickstream Message Receiver, Upload Controller 

5) Clickstream Event API 



The STB based Clickstream Processor will reside in, and be executed from DRAM. It will be downloaded to the 
STB during the Level 2 boot process, and will be designated as a * 'Resident'' application by the PowerTV Operating 
System (It will remain memory resident over soft powerdown STB states). 
The Clickstream Kernel will buffer Clickstream Events handed to it by applications. 

The Clickstream Message Receiver will accept control messages over the ESF Pass-through data link to control the 
uploading of information over the reverse path network and will store the payload of these messages in appropriate 
and available memory (NVM when possible). It will also accept the messages sent to it to acknowledge tberecopt 
of a number of Clickstream Data Packets. 

The uploading of data will occur on a scheduled basis over every 24 hour time period. Clicks treams will be uploaded 
over the system messaging or "LI Pass-Thru*' protocol. This scheduling function will allow the distribution of 
uploads evenly over 24 hour periods to minimize network impact Uploads may also be suspended every day during 
high network usage times (prime time, etc.). 
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Figure 2: STB Clickstream Journalllng Flow 



2.1.2. HP VTE Platform 

The video server simply acts as a pass-thru device for both STB sourced Gickstream Uploads, and for data ACKs 
returned to the STB. There is a communications router inside the VTE which will re -direct IP traffic to the 
appropriate destination. 



2. 1.3. Sybase IMS Platform 

The IMS platform will have two functions pertaining to the Gickstream system. One will be to act as a pass-thru 
device for STB sourced Clickstream information going to the Staging Server* and as a pass-thru device for Staging 
Server data acknowledgments to the STB. This communication will be handled through the Sybase Open 
Qient/OpenServcr Software and will take the form of Remote Procedure Calls (RPCs) executed on the IMS. The 
other function will be to replicate IMS log information to the Staging Server for inclusion in the Metadata winch is 
further replicated to the MKIS. 
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2.1.4. Sybase Staging Server Platform 
The Sybase Staging Server will have four main functions. 

1) It will act as an endpoint in the Clicks tream upload process. This is handled by a process and flat file 
persistant storage called "Event Capture". It will use the RPCs made from the IMS server to pass Clicks tream 
information over, and formulate and send data acknowledgments to the STB. 

2) It will handle replicated IMS data logs destined for the MKIS system. 

3) It will have a decompression, parsing, and data merging mechanism which will separate the Clicks tream 
data into appropriate relational database entities as oudined in the MKIS specification. 

4) It will act as a persistent storage device from which MKIS database information can be replicated. 

2.1.5. MKIS Platform 

The MKIS system will serve several functions within the Clicks tream system. The particular design of this database 
and analysis engine is covered extensively in the document "IT Architecture Design Ver. 1.1". A top level 
look at this system reveals 4 main functions. 

1) Mass data repository and relational database structure for 

Clicks tream, subscriber, demographics, billing , advertisement and content information 

2) To enable regular data uploads to third party analysis engines, and to be able to restrict access to sensitive 
data fields. ; S nne 

3) To perform "native" data analysis functions and report generations to different specifications. 



Applicati on 
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3. STB Clickstream Processor 

3.L Overview 

The Clickstream processor is structured to store Clickstream event records in one of two Clickstream double buffers. 
The Clickstream buffer sizes are staticly provisioned by the PowerTV OS as defined in the code download process. 
The buffer size will be optimized during the test and integration phase of the project The possibility has not been 
ruled out to addressaMy download the size of the Clickstream STB double buffer. The size of any particular 
Clickstream event record is variable to handle the difTerent journaling needs of different applications. An event record 
is the individual logging of a subscriber input to the STB. The generation of these events fill one of the two 
Clickstream buffers. When a buffer is filled, or an upload timer event has expired, the Clickstream processor is 
flagged to initiate an upload process with the Staging Server based Event Capture application, and the buffer 
becomes locked. Subsequent Clickstream records are stored to the second Clickstream buffer. 

The upload process consists of several steps. The buffer to be uploaded through the network is locked. No more 
Event Records can be stored to this buffer. The buffer is then compressed. The PowerTV OS will support the LZW 
compression utilities. These will be employed to compress the buffer. Fifty percent compression is acheivable with 
this algorithm on random binary data. Clickstream data roughly fits into this category. The resultant compressed 
data is then divided into transmission "transactions" or "packets" and data is placed into the packet headers to indicate 
packet ID, IP destination address, etc CRC error detection will either be placed on each packet by the Clickstream 
Processor (application level), or by the UDP driven in the OS. This issue is currently TBD. Each data packe^haa 
UDPflP protocol around a LI pass-thru header. The final bit-format level specifications have not yet been received 
from PowerTV , but will be included in this specification on their receipt 

The Clickstream data packets are uploaded over the SIotted-ALOHA data transmitter out of the STB. The Staging 
Server based "Event Capture" application will receive these records and transmit addressable responses to the STB in 
the form of data acknowledgments. The frequency and periodicity of data acknowledgements will be determined 
during the implementation stage of the project. Correct implementation of application level data acknowiegements 
require empirical knowlege of network bit error rates, netowork packet error rates, and causes of these errors in 
transmission. 

If the second buffer becomes completely full and the upload process has not completed, then a buffer ovemm 
condition will occur. The last record in each Clickstream buffer is reserved for a buffer overrun record with a 
times tamp, this essentially tells the Staging Server resident application that STB clicks were missed starting at a 
particular time. This error condition can then be corrected by increasing the buffer sizes. 

(NOTE: Receipt of a buffer ovemm record will tell us that buffer sizes have not been set appropriately, and we will 
have to re-set buffer sizes, re-compile our code and release it to the system as an update. We may also implement an 
addressable parameter setting the Clickstream buffer sizes dynamidy. (There are reasons both pro and con to do 
this). Appropriate buffer sizes should be determinable within the first month of integration testing.) 



3.2. Clickstream Kernel 
The Clickstream code will be downloaded to the STB during the Level 2 (L2) boot sequence as outlined in the 
BellSouth/Sybase document 4 TTV Process Flows Rev. 1.00". It will be bundled along with all Binary Large 
OBjects (BLOBs) intended to be memory resident , and downloaded at initial AC power on following the Level 1 
(LI) bootterm . Bundling of services takes place within the IMS server and takes the form of a "list of services" 
which is the first item downloaded to the STB on L2 Boot There is a memory allocation and a prioritization table 
associated with each of these applications downloaded into DRAM. They are contained in what PowerTV terms a 
•THspatch Table" included with each downloaded application. For more information on lists of services and the code 
download process, see Sybase IMS documentation, and the PowerTV document "Loader Specification", and 
"Application Download Procedure". 
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3.3. Clickxtream Buffering 

The database is allocated into a double buffer (two blocks of memory). Each block is a contiguous free area of 
DRAM which is set aside for this application use only. The buffer sizes should not need to be very large (10K- 
15K7), so allocation of contiguous blocks should not be a problem. Utilization of more advanced database 
techniques such as linked lists or record pointers should not be necessary in this type of application and would only 
increase the application footprint and complexity. The database buffers should be allocated to allow at least 4 to 8 
hours of peak channel "surfing" between uploads. This will lead to provisioning the actual size of the data buffers 
after we get some empirical results on subscriber usage. This should keep system bandwidth needs within reason. 
Data compression techniques will be employed to further reduce the transmission bandwidth needs. 

The Clicks tream event records consist of data specified by Aibitron/NCM in the document "Database Design 
Specification NCM I 3 **, plus additional fields used on an application specific basis to allow for flexible 
documentation of subscriber usage of the interactive system. This data is collected from two sources by the STB 
when an event is registered. The application generating the Clicks tream event hands ofT data regarding the 
application state and the subscriber UI which caused the event. The application will also hand over information 
obtained from the OS such as timestamp and STB id. 

3.4. Clicks tream Event Format 



3.4.1, General Header Format for Events 

The general event format is tailored to be as flexible as possible. Each application running on the STB will interface 
to the Clicks tream routines to send just the data type and format that it needs to journal. 

The general information sent by each application will uniquely identify the time of event generation to the second, 
the application which generated the event and the number of bytes to follow which constitute the information the 
application wishes to journal. 

(Developer Note: The Application ID sent in the API call to the Qickstream Processor and stored in the Buffer 
as shown below is NOT the Application ID given to your application by the Operating System during 
initialization, but the Application ID listed in section 3.4.3 of this document This identifier is unique and static 
and will enable backend systems to parse the events jouroaled by your application. The application ID given to you 
by the operating system at application initialization is dynamic, and will change every time your application is 
launched. If there is no Application ID listed for your application in Table 4 of section 3.43, please contact current 
contact person listed on the cover of this document to have one assigned to you.) 
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Syntax 


Size/Format 


Clickstream Buffer () 




{ 




Buffer Header Record { 




Transaction Code 


ui8 


Qickstream Version Number 


ui8 


Timcstamp 


ui8*6 


Number of Bytes 


ui8 


STB MAC Address Most Significant 


uil6 


STB MAC Address Least Significant 


ui32 


Compression Type 


ui8 


> 




Event Record 




for (i=0; i<N; i++) f 




Timestamp 


ui8*6 


BIMs Assigned Application ID 


uil6 


Number Bytes to Follow (length) 


ui8 


♦•Application Specific Data** see section 3.4.4 for specific 
*~ formats used by each application- 


ui8 * length 


> 




Clickstream Trailer Record { 




Timestamp 


ui8*6 


BIMs Assigned Application ID 


uil6 


Number Bytes to Follow (length) 


ui8 


Upload Status Code 


ut8 


> 




\ 





N = Number of Event Records able to be stored in this ClickstreamJJuffer. 

Figure 3: Clickstream Buffer Format 



There are two buffers such that one may be "frozen" during the upload process t and the second buffer can then be 
employed to continue jouraaling event records. Both buffers are of identical format. This format is shown in Figure 
2 above. The notation used in Figure 2 is ISO standard for description of data structures. 

The Clickstream Buffers are formatted specifically to ease in their transmission back through the distribution 
network to the Staging Server "Event Capture" data repository. The first section of entries comprise a header record 
which will indicate the time the buffer was first opened, the number of bytes in the buffer, will define the originating 
STB by MAC Address, defines the version of the Clickstream Kernel which generated the buffer, and specifies the 
type of data compression used on the following data (if any). This first record is of fixed length and is never 
compressed. All bytes following "Compression Type" are possibly compressed to save in transmission bandwidth. 

The Clickstream .Trailer record appears at the end of a Clickstream Buffer and usually denotes an error condition. If 
the upload capabilities of the system did not allow the clearing of data from the STB Clickstream Buffers. This 
record distinguishes itself by the Application ID pointing to a "Missed Records" type so that queries can be 
performed on this data in one of our backend systems. The Upload Code is then used to identify the possible cause 
for the error condition. These error codes are outlined in section 3.4.3. 
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3.4.2, Upload Codes 

Upload Codes in the general event format are meant as an indicator of what state the Clicks tream application was in 
when a buffer overflow occurred. This code indicates what stage in the upload process the Clicks trcam kernel was in 
at the time of overflow . This will help us determine the cause of buffer overflows, and the appropriate way to solve 
these problems during testing and integration. 



Upload Codes 


0x00 


Not Used 


0x01 


Upload in Progress 


0x02 


Upload Completed, No Acknowledgement Received 


0x03 


Upload Completed, Partial Acknowledsements Received 


0x04 


No Upload Attempted 



Figure 4: Upload State Codes 



3.4.3. Application Unique Identifiers 

Each application with the ability to operate on a BellSouth Interactive STB will be assigned a 16 bit unique 
identifier. In addition, each application will have the option of posting an 8 bit Application State identifier within 
it's application specific data. This document, and further revisions, will act as the authority for the assignment of 

taHe specifies these "AFP IDs". 



Application Identifiers 


0x0000 


Operating System 


OxOOOl-F 


Operating System Sub-Systems (TBD) 


0x0010 


Application Manager 


0x0011 


Cable Television Application 


0x0012 


Qickstream Set-Top Box Kernel 






0x0100 


Prevue Interactive Services - EPG System 


0x0 101 


Digital Pictures - Interactive Game 


0x01 10-F 


Viacom - MTV / Showtime, etc. 






0x1000 


Interplay Written Applications General ID (TBD) 


0x1001 


Interplay Runtime Engine 


0x1002 


Interplay Navigator 


0x1003 


Interplay VOD 


0x1004 


Interplay NVOD 


0x1005 


Interplay TownGuide 






0x1100 


The Weather Channel, Weather On-Demand 


0x1101 


Woddspan - Travel On-Demand 


0x1102 


Lightspan - Educational Interactive Application 


0x1103 








OxFFFF 


Missed Events Record 



Figure 5: Application Unique IDs 
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3.4.4. Application Specific Event Formats 

Developer Note: The CHckstream formats proposed in this section are close to what we currently believe will 
be the final data formats. As we get a better understanding of all of the analysis needs, these data formats may 
change. 

3.4.4.1* Operating System 

The operating system will have the ability to make the same call to the Qickstream system as any other code 
residing on the STB. It may use this facility to log errors, for testing purposes, to track code usage, or for any other 
reasons. The OS tie-in to the Qickstream system will be predicated by PowerTV's interest in being involved, and is 
not seen as likely in the short term. 

J. 4. 4. 2. Application Manager 
Application Manager is a portion of the "Level 1 Client" as defined in PowerTV operating system documentation. 
This code will be responsible for managing the execution of all applications on the STB, handling application 
switching, removing applications which are mis-behaving, and the like. BellSouth will be interested in logging of 
all application switching activities. A data format has been established to this end. Implementation of Application 
Manager po^nng Qickstream Events will depend on schedules and interesUevel of PowerTV. •_ 



Syntax 


Size/Format 


ion Manager: Application Specific Data 0 








Event Record 




for (i=0; i<N; i++) f 




Event ID: See Global Event ID table for Syntax 


in 16 


Suspend Application ID 


uil6 




uil6 


Application Error Code: See below for Syntax 


ui8 






\ 






Error Code Syntax 


Data 


Application Mutagen Application Error Code () 


ul8 


f 




Do not use (No Error) 


0x00 


Application termination reason unknown 


0x01 


Missed Watchdog Tuner (Application Keep- Alive) 


0x02 


Unexpected Session Teardo wn 


0x03 


Memory Error 


0x04 


Application will not download 


0x05 


Communication Error 


0x06 




0x07 


\ 





Figure 6: Application Manager Event Data Format 
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J. 4. 4. 3. Cable TV Application 

The Cable Application is the second half of what PowerTV calls the "Level 1 diem". It is responsible for tuning 
both analog and digital broadcast services. 



Syntax 


Size/Format 


Cable Application : Application Specific Data () 




f 




Ereot Record 




for (i=0; i<N; i++) { 




Event ID: See Global Event ID table for Syntax 


uil6 


Channel ID : See Broadcast Channel ID table for Syntax 


ml 6 











Figure 7: Cable Application Event Data Format 



The Channel ID 16 bit field is a unique identifier for broadcast networks. In analysis of data, the fact that channel 6 

was watched more than channel 7 has little or no meaning to BellSouth. When the network associate d wit h the 

channel is used, the data becomes both clearer and more valuable. To illustrate, the fact that 4 TBS" was watched 
more frequently than "MTV" would be valuable information, and is more in line w*th our end requirements. This 
encoding method also allows the data to stay consistent across channel lineup changes that will take place during the 
trial, or as the system grows past technical trial this encoding scheme will allow Clickstream data from various 
headends in locations all over the southeast to generate consistent data no matter what their individual channel 
lineups may be. A list of all major broadcast networks in the US appears in the BellSouth John Stefamk authored 
document "Residential Broadband Applications Guide V 1 . 10". The encoding scheme for these broadcast providers is 
defined in this document in the Metadata formats section. The Event ID as listed above defines the action which 
occurred on that channel at that time and is also defined in the Metadata section of this document 



3. 4. 4. 4. Clickstream STB Kernel 

The Clickstream Kernel may generate Clickstream events itself to log errors in Clickstream collection and/or 
transmission. We will have a better handle on the types of errors or faults we would like to capture during our 
and integration phase. This information will be folded into this design document on an as-needed basis. 
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3.4.4.5. Navigator 

The Navigator is the interactive menuing system which enables viewers to select from our many interactive services. 
This is a BellSouth initiative and we will be collecting usage data to analyze how easily viewers are able to access 
programming they are after. 



Syntax 


Size/Format 


Navigator : Application Specific Data () 




{ 




Event Record 




for (i=0; i<N; i++) f 




Event ID: See Global Event ID table for Syntax 


uil6 


Application State ID: See table below for Syntax 


ui8 


for(i=0;i<Nl;i++) f 




Character S trine: ASCII, Variable Length 


b8'Nl 


> 




> 




> 






State J D 


Data 


Navigator: Applicatlon State.ID () 


ul8 


f 




Fly-Thru 


0x00 


Main Menu 


0x01 


Information (Help) Screen or Video 


0x02 


Movies Sub-Menu 


0x03 


Movie Categories Sub-Menu 


0x04 


List of Movies Sub-Menu 


0x05 


Movie Info Screen 


0x06 




0x07 


) 





Figure 8: Navigator Buffer Format 
3.4.4.6.VOD 

Video On Demand will be launched from the Navigator and will be a BellSouth service. Clicks tream capturing will 
be interested in the amount of pausing. Fast-forwarding and Rewinding people do, as well as what is being viewed at 
a particular point in time. Some of this will change as we get a better idea of exactly what events are logged by the 
IMS server. We will try not to duplicate effort wherever possible. If it is determined that the IMS Event Log is 
capturing this level of information, then the Clickstream Joumalling of this information is redundant and will not be 
implemented. 
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Syntax 


Size/Format 


Video On Demand (VOD) : Application Specific Data () 




f 




Event Record 




for (i=0; i<N; i++) ( 




Event ID: See Global Event ID table for Syntax 


uil6 


Application State ID: See table below for Syntax 


ui8 


> 




> 






State ID 


Data 


VOD: Appllcation_State_ID () 


ul8 


{ 




Playing 


0x00 


Paused 


0x01 


Fast Forward 


0x02 


Rewind 


0x03 


Info (Help) Video or Screen Played 


0x04 


reserved 


0x05 


reserved 


0x06 


reserved 


0x07 


) 





Figure 9: VOD Buffer Format 



3.4.4.7. NVOD 

This application is also a BellSouth service. The following tables outline expected application specific event 
formats, and application states possible within the application. The NVOD application will be active without an 
active Client/Server connection to the IMS. Clickstream System will be the oly mechanism for BellSouth to glean 
usage information on this application. 



Syntax 


Size/Format 


Near Video On Demand (NVOD) : App Specific Data () 




i 




Event Record 




for (i=0; i<N; i++) f 




Event ID: See Global Event ID table for Syntax 


in 16 


Application State ID: See table below for Syntax 


ui8 


} 




} 
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State ID 


Data 


NVOD: Application.StateJD () 


ui8 


< 




Playing 


0x00 


Incremental Pause 


0x01 


Incremental Skip Forward 


0x02 


Incremental Skip Backward (Rewind) 


0x03 


Info (Help) Video or Screen Played 


0x04 


reserved 


0x05 


reserved 


0x06 


reserved 


0x07 


l 





Figure 10: NVOD Buffer Format 



3.4.4.8.EPG 

Prevue Guide Interactive Services are expected as the Electronic Program Guide (EPG) provider for the BellSouttf 
Interactive Trial. 



Syntax 


Size/Format 


Electronic Program Guide (EPG) : App Specific DaU () 




f 




Event Record 




for (i=0; i<N; i++) f 




Event ID: See Global Event ID table for Syntax 


in 16 


Application State ID: Sec table below for Syntax 


ui8 


reserved 


iri8 


> 




> 






State ID 


Data 


EPG: Application State ID () 


ul8 


t 




Initial Display Screen 


0x00 


Look Ahead Display 4 hour 


0x01 


Look Ahead Display 8 hour 


0x02 


Look Ahead Display 12 hour 


0x03 


Look Ahead Display 16hour 


0x04 


Look Ahead Display 20 hour 


0x05 


Look Ahead Display 24 hour 


0x06 


reserved 


0x07 


b 





Figure 11: EPG Buffer Format 
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3.4.4.9. Home Shopping 

Expected data formats will be developed following the successful deployment of initial offerings of interactive 
applications in 1995. 

3.4.4. 10. Other 

Expected data formats for a large number of interactive applications will be developed following the successful 
deployment of initial offerings of interactive applications in 1995. 
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3.5. Applications Programming Interface (API) to Clickstream 



To interface with various third party applications, the Clickstream Processor has utilized the cooperative nature of 
the PowerTV operating system which allows for internal point to point messaging. The PowerTV operating system 
uses the concept of EVENT very broadly to handle everything from thread usage, semaphores, system thread 
contention, timer handlers, exception handling, etc. Using the EVENT system* BellSouth has defined a unique 
Operating System Event Type: 

( kEt.clickStream ), which will be used for inter-process communication with the Clickstream Kernel. This allows 
us to get away from the alternate design of having to support Dynamic Link Libraries (DLLs) for all interested 
applications, and the support and complication it would have introduced into the system. 



Developer Note: Make sure that the difference between the "Operating System concept of Event" and the 
generation of a "Clickstream event" remain somewhat clear to you. they can be easy to confuse one for tlie other. 

The Clickstream event, or Clickstream Journalled event is a data set that identifies subscriber action on the ITV 
system at a particular point in time. e.g. "Channel was incremented once from channel 5 at 5:12PM on Tuesday." 

The PowerTV Operating System Event system and PowerTV Event data structure are generic ways the OS gets its 

job done. The system and the data structure are emp loyed to handle a myriad of OS responsibilities. We are simply 
using this Event system as a way to get our Clickstreams from point A to point B within the Set-Top Box. 



3.5.1. PowerTV OS Events: 

Running under the PowerTV OS, an application can generate Events, receive Events, filter reception of Events, etc. 
The Event itself consists of 5 parameters: 



PowerTV General Event Data Structure 


Size/Format 


Event () 




i 




Code 


Defines the Event Type, Event source, 
what type of device generated the Event, 
and a general 8bit field for Event data. 


ui32 


X 


Application specific data field. 


ui32 


Y 


Application specific data field 


ui32 


Z 


Application specific data field. 


ui32 


T 


System time when Event was generated, (this is 
from a 64 bit PowerPC register and has no link to 
real world time, it is used for processing the Event) 


ud64 







Figure 12 : OS Event Data Structure 
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An application will start receiving Events when it uses PowerTV APIs to "register interest" in a particular, or a set 
of particular Events. An application can start generating Events at any time by simply using the appropriate APIs. 
The Code field of the Event is what PTV uses your mask to check against any Events that are generated. A quick 
look at the way the Code field has been structured (above) should demonstrate that there are several ways the Code 
can be used to filter and route Events. When the App registers Event interest it defines an Event mask in terms of 
the Code field. The OS then uses this mask along with all others it has received to decide where an Event should be 
routed, and in which Event Queues it should be placed. When Apps register interest they also specify a DRAM 
memory location called an Event Queue which acts analagous to a data mailbox. If your application asks for a lot of 
Event types, (e.g. **Give me all Events generated by all external devices") you will receive alot of "mail" in your 
Event Queue memory locations, if you ask for only certain Events (e.g. 4< Give me an Event only when the Power 
button is pressed") , you will receive just sporadic Events in your "mailbox". 



3.5.2. Cllckstream Event type: w kEt_cllckStream": 

BellSouth has defined a unique Operating System Event Type for use in the BellSouth system, and has done so with 
PowerTV approval. Today this Event Type is defined as a "#define" in our application source, but will appear in the 
PTVTYPES.h file which PowerTV releases as part of their standard Operating System release in the near future. 
(This #define will only appear in releases of PowerTV for BellSouth use and 3rd party application developers for 
BellSouth.) 

Clicks Oram Kernel is initialized following Level 2 boot by the PTV OS Application Manager. During it's 
initialization process it registers interest with the PowerTV OS in any OS Event of the "kEt_clickStream" type. It 
is the only application which registers interest in this Event type. When an application with focus (or otherwise) 
reaches a point in code where an action worthy of jouraalling has occured, it places the contents of the Clicks tream 
Journalled event into a known area of DRAM. It then launches a kEt_clickStream type Event With a 
kEt_clickS tream Event the following syntax is used 



kEt clickStream Structure 


Contents 


Size/Format 


kEt clickStream 0 






{ 






Code 


0x00008000 


ui32 


X Clickstream Application ID 


See section 3.4.3 Fig. 5 for ID used 
by each Application. Fill with leading 
Zeros to be ui32. 


ui32 


Y Length of Data 


Number of dick Bytes at DRAM 
location Z. Also leading Zero stuffed. 


ui32 


Z Memory pointer to data 


♦DRAM pointer 


in32 


T Operating system will append the 


Application does not use this Held. 


iri64 


) , 







Figure 13 : Cllckstream Event Type Date Structure 



The Clickstream Kernel will receive the OS Event sent by the application. It will use the information provided t 
the X. Y, and Z parameters to grab the Clickstream data out of it's temporary location in DRAM and into the 
Clickstream double buffer area. The Clickstream Kernel will concatenate this Clickstream data at the end of the 
buffer it is currently maintaining. It performs a level of memory management on the double buffers to maintain 
current pointers of free memory. 
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3.5.3. Clicks tream API Source Code Example 
The following is a source code example of a miscellaneous application making a Clickstream joumalling call: 



I J *******G enera | Header c oc | e **************************»*** 
/ / Apps would place this first part of code once in a header area, 
/ /or make sure it was in a # include 

tf define CLICK / / make it easy to pull the code out at compile time 

#ifdef CLICK 

9 define kEt_clickStream 0x00008000 / / all apps who wish to journal use this #define, it will 

/ / live in the PTVTYPES.h file by end of year. 

ffdefine kEd_dickEntry 0 / / event data for normal click stream event 

#define CUCKBUFFER_SIZE 0x20 / / local storage of one or two clicks, this size will be 

/ / slightly application dependent, but 16 to 32 bytes 
/ / should be sufficient for most Applications. 

U define APPJD 0x0000 // Application Identifier should be defined here. 

// IDs for all applications are listed in Figure 3 



void *dptr; / / declare a pointer to local Click data storage 

} 

#endif 



// ** End of General Header Code* 



I J Code for each key c | ick occurences 

#ifdef CLICK / / make it easy to pull the code out at compile time 

dptr = pk_Galloc( CLICK BUFFER_SIZE ); //Application needs to grab some amount of 

/ /DRAM for temporary storage of Clicks locally. 
sprintf((ui8 *)dptr,"clickstream stuff); //place Click data into memory at the *dptr location. 
/ /This is an example of placing a string of ASCII text at this location. App developers will be 
responsible for writing code to place a data struct as outlined in Section 3 into memory at *dptr. 

pk PostEvent(kEt clickStream I kEdjrlickEntry, APPJD,(ui32)strlen(dptr),(ui32)dptr); 

J " 
tfendif 

I J Em j Q f c<K | e f or eac h (< e y occurances ♦♦*♦*♦*♦♦*«*«**♦***♦♦•♦****♦***♦*♦♦♦***♦*♦« 



Private/Proprietary to BellSouth Interactive Media 



21 



BellSouth Interactive Media Services 
CHckstream System Spec PRAFT vjnW 



Developer Note: 

The code associated with each Key Click will be responsible for Malloc'ing memory for each Click, and the 
Clicks tream Kernel will then De-allocate the 

memory assigned with every Click. Because of this it is imperative that the sending application check to make sure 
that the clicks tream application is up and running in background mode. The sending application can do tins by 
querying the Application Manager for an application by the name of "CUCKSTREAM". This call will also retain 
to the calling application the system event que of the dickstream Processor. If the Oickstream Processor is NOT 
running in the background to De-allocate this memory, there will be memory leaks associated with each attempted 
journalling call by the sending application. 
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3*5.4. CHckstream Upload 

Once a Clicks trcara buffer has been filled, or the Gickstream Kernel has decided an upload of data is appropriate for 
other reasons (timed upload, low system utilization, commanded upload, etc.) the buffer will be formatted, 
compressed, and then uploaded through the system to the Sybase Staging Server. 

The Gickstream upload takes place over the system messaging or Level 1 pass-through protocol. The data protocol 
is uni -directional UDP/IP and so there will be application level acknowledgements from the Staging Server during 
the upload process. The STB application will use the PowerTV system messaging APIs to send data packets 
upstream 




Level 1 Distribution Netwoik 



▼ 








CMC 1 








r \ 



(ESF) 



Ctictcstream 
Picket ACKs 



(Sk*te4 ALOHA) 



CGckstrcam 
Packets 



|99~ 



1 1 1 



□ 



STB 
-T 



I 



Figure 14: Upload Data Flow 
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3. 5. 4.1. System Upload Control I Load Distribution 

There will be several ways for the upload of Clickstreani information to be controlled. A broadcast transaction set 
will be used to set up the upload control method used, or uploads can be commanded by the system. In 
implementation, this means MKIS or Staging Server will source a set of transactions delivered over the Level 1 
pass-through messaging to perform this upload control. The data associated with each STB will be stored in Non- 
Volatile Memory. Because of the small footprint of the data, and the adverse effects that would be caused if the 
upload control data parameters were not present within the STB, this data will reside in NVM. 



3.5.4.1 .1 .STB Group Assignment for Clickstream Uploads 
In order to assign STBs to a uniformly distributed set of groups during the trial period, the 48 bit STB MAC address 
can be used to randomly distribute boxes within groups in the system. The Clickstream Kernel will use the least 
significant 5 bits of the MAC address to determine the Clickstream group in which it belongs. This will define 32 
groups within the system. This way a separate addressable transaction data packet for group assignment does not 
have to be defined, transmitted, and received during the trial period. This is a short term solution, and would have to 
be modified to an addressable group assignment methodology concurrent with large scale deployment 



STB MAC Address 



Byte 1 


Byte 2 


Byte 3 


Byte 4 


Byte 5 


B y 








7 6 5 




Figure 15: Clickstream Group Assignment 



3.5.4.1 .2. Upload Control Broadcast Message 



This one message depending on how it is configured with data will allow for the following control: 

1) Cyclic Upload Control to Groups 

2) Cyclic Upload Control to all STBs 

3) Upload on Command/Polling Control (addressable only) 

4) Suppression of Upload on buffer full 

5) Collection Control On/Off addressable 



TCP/IP Protocol Header 


Adaptation Type 


Protocol 
Discriminator 


40 Octet 

Destination: CMC 
Sender. VTE 


1 Octet 
0x02 = TCP 


I Octet 

(fixed for BLS Trial) 
0x80 






Adaptation Data 


Message Id 


Request Id 


Address Mode 


2 Octet 

Length (Octets to follow. 
Not including Trailer) 


I Octet 
(Fixed) 
0x60 


2 Octet 
Not Used 


1 Octet 

0x01 = Addressable 
0x04 = Broadcast 






Level 2-De fined > 
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Destination.Address 


L2_DataCount 


Data_Type 


VSP_Id 


6 Octet 

• MAC Address or 

• OxM-t-t-b = Broadcast 


2 Octet 

# Octets to Payload End 
Variable 


1 Octet 

0x00 = User Data 


1 Octet 
0x01 = BIMS 


Level 2-Dcfined > 



CUckstream Upload Control (Transaction.Code and Transaction JPayload) 


Octetf 


Contents 


T 0 


Transaction Code MSB = 0x80 


T 1 


Transaction Code LSB =0x10 


0 


Clicks tream Processor Version Number 


1 


Global 1 Addressable Flag 1 Group Address - Denotes the group of STBs to field this 
Flax (bl) 1 (bl) | transaction (b5) 


2 


Collection On/Off Key - Will turn Clicks trcam collection On/OfT to a STB or 

Group of STBs (non-Global only) 


3 


Perform Upload Now Key - Will perform. an upload on command. 

Will only upload on command if Global Has is NOT set 


4 


Suppress Upload on Buffer Full - Will keep the STB or Group from uploading when buffer is full. 

The STB or Group will only upload on it's appointed upload cycle. 


5 


Upload_Cycle_Definition - A STB will have 1 to 4 possible upload cycles defined. 

inis wm aeiine any one oi inosc cycles. 


© 


v-ycie rirsi vjccurrence oian, nme ^loiai ow) - rear 

I^vtlUva UK UulV UI UlV 111 31 UfJllMU 111 VjrVIC 


7 


Cycle First Occurrence Start Time - Month (b8) 


8 


Cycle First Occurrence Start Time - Day (b8) 


a 

* 


fvrJe Rr*t Or**nrr»»nr~ Qtnrt Tim« . Hour (hfC\ 


A 


Cycle First Occurrence Start Time - Minute (b8) 


B 


Cycle First Occurrence Start Time - Second (b8) 


C 


Upload Duration (Total b24) - Hours(0-24) (b8) 
Defines a duration of time over which the STB randomizes upload start time. 


D 


Upload Duration - Mimites(0-59) (b8) 


E 


Upload Duration Seconds(0-S9) (b8) 


F 


Cycle Time (Total h32) Days (0-14) (b8) 
Defines the periodicity between uploads. This is the mean time between uploads. 


10 


Cycle Time - Hours(0-24) (b8) 


11 


Cycle Time Miiiutes(0-59) (b8) 


12 


Cycle Time - Scconds(0-59) (b8) 


0x13 
to 

0x27 


reserved 



TCP/IP Protocol Trailer 



CRC Checksum, etc 



VTE Inserted > 

Figure 16: Upload Control Transaction Definition 
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Data Dictionary: 

The Qickstream upload cycle consists of several parameters which define a start time and a cycle on which the 
uploading of data happen. The "first occurrence" defines a starting time in history from which the cycle runs. The 
"cycle time'* defines the amount of time which elapses between periods of the upload cycle. When a cycle is 
complete the **upload duration * time starts, and the STB Qickstream Processor will randomize an exact upload time 
within the Upload Duration. This will distribute the network load relatively evenly over the entire Upload Duration 
period. 









Upload of 
(fata occurs 








Upload of 
(fata occurs 












Cycle Time / 1 


r 

CydeTime . 1 



Cycle First Occurance Start Time 



Time 



Figure 17: Upload Cycle Diagram 



An example of the use of these parameters would be to define a period of time every day for STBs to upload data. 
BellSouth has a requirement to perform analysis of uploaded data on a daily basis. This should be available every 
morning towards the beginning of the work day. Peak usage of broadcast Prime Time and of Interactive services will 
typicly be from 7pm until Midnight. This would be a time period when uploads should be inhibited to limit the 
burden on the Level 1 distribution network, which will be busy with exclusive session setups and tear-downs. 
Beginning at Midnight, Qickstream uploads would begin. In order to have all STBs upload before 8am, and given 
the number of upload groups within the system to be 32, each group would perform uploads over a 15 minute 
period. To achieve this upload cycle: 

Cyde_Hrat_Occurance.Start_Time = 12:00am Jan 1, 1995 + "X" * 15 minutes. 

Each upload group would have this parameter staggered by 15 minutes. 
Cycle Time = 24 hours 

Upload Duration = 15 minutes 



A total of 4 upload cycles will be able to be defined for each group of STBs. This would allow for weekly uploads, 
or any other combination of cycles to work around peak LI load times. 
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3. 5. 4. 2. Communications Procedures for Upload 

The STB will initiate the upload process. When the STB has determined it is appropriate to perform an upload of 
Clicsktream data (for any of the reasons covered in section 3.5.4), it will lock the DRAM buffer actively being 
filled, it will compress the buffer using the LZW compression utility, and it will perform the upload of the 
Clickstream buffer over the LI Pass-Through network. 

The STB initiates reverse path LI Pass-Through by calling the "sess_MessageRequesr PowerTV API. Within the 
parameters of this API are the addressing Mode, the destination address, the size of the "message" packet to send, and 
a pointer to the data in memory to send upstream. 

ix sess.MessageRequest ( u(8 mode, ul8 address[6], ull< sp_count, ul8 *sp_data); 

From this point, the PowerTV operating system and lower level drivers will handle the uploading of the message, 
including the insertion of the UDP/IP protocol layer and CRC-32 transport trailer. 

There WILL be a limit to the size of the data allowed in a single reverse path LI Pass-Through transaction. Today, 
this is not explicit in PowerTV documentation, but in disclosure documentation it is stated as 252 Octets. 
Because most Clicks treara uploads will require more than this, the Clickstream Processor will be making sequential 
calls to this MessageRequest API to complete a single upload process. 

There are several different addressing modes available in LI Pass-Through. The one that Cickstream Processor will 
be using will identify a VSP as the destination of these data packets. That address_mode is 0x03. The address 
following this addiess_mode is a 2 Octet SPJD (short for VSP identifier). It should be right justified and 0 filled 
into the 6 byte field available. The sp.count and *sp_data are simply the size and pointer to the data to be uploaded. 
All of the data to be uploaded appears as "Payload" to the STB, the signalling network, the CMC, and the Event 
Capture mechanism on the Staging Server back end Level 2 system. 
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IP Header 


UDP Header 


Adaptation Type 


Protocol 
Discriminator 


20 Octet 

Destination: CMC 
As defined in RFC 791 


8 Octet 

As defined in RFC 768 


1 Octet 
0x01 = UDP 


1 Octet 

(Fixed for BLS Trial) 
0x80 


PowerTV OS inserted > 






Adaptation Data 


Message.Id 


Requested 


Address.Mode 


2 Octet 
0x0000 or 
sequential counter 


1 Octet 
(Fixed) 
0x60 


2 Octet 
NotUsed 


1 Octet 

Address of VSP 
0x03 = "SPJD" 


PowerTV OS inserted > 


Application-Defined> 






Destination_Address 


L2_DataCount 


IP Address of AS 


Service ID Length 


6 Octet 

•Address of VSP 


2 Octet 

# Octets to Payload End 
Variable 


4 Octet 
(Fixed) 

• IP Address of App Server 


4 Octet 
TBD 


Application-Defined > 






Service ID Value 


STB MAC Address 


Data_Type 


VSP_Id 


4 Octet 
TBD 


6 Octet 

• Hardware Address of STB 


1 Octet 

0x00 = User Data 


1 Octet 
0x01 = BIMS 


Application-Defined > 



CUckstream Upload Data Packet (Transaction.Code and Transactlon_Payload) 


Octet* 


Contents 


T 0 


Transaction Code MSB = 0x80 


T 1 


Transaction Code LSB =0x18 


0 


CUckstream Processor Version Number 


1 


Upload Sequence Number 


0x02 

thru 

OxFA 


CUckstream Upload Buffer Data Structure (as outlined in Figure 3 ) 

The data structure is broken up into as many reverse path transactions as are neccessary to perform 
the complete upload of data. 



UDP Transnort Trailer 


Control 


Length 


CRC-32 


2 Octet 


2 Octet : # Octets from 


4 Octet 


(fixed) 


IP Header to Payload End 




OxH-FF 


Variable Length 


CRC-32 Calculation 


PowerTV OS inserted 







Figure 18 : Upload Transaction Definition 
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Figure 19: Upload Data Flow 2 



3.5.4 .3. System Acknowledgements on Clickstream Buffer Uploads 
The Event Capture Procedure running on Staging Server will be responsible for formulating and sending some level 
of data acknowledgements to each STB engaged in the upload process. The details of the acknowledgement process 
is still a work-in-progress. The acknowledgements will go out as addressable downstream Level I Pass-Through 
transactions. They will enter the STB through the UDP/IP protocol stack, and be handed to the Clickstream 
Processor as a PowerTV Event which the Processor has registered an interest in. 
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TCP/IP Protocol Header 


Adaptation Type 


Protocol 
Discriminator 


40 Octet 

Destination: CMC 
Sender. VTE 


1 Octet 
0x02 = TCP 


1 Octet 

(Fixed for BLS Trial) 
0x80 


VTE inserted > 




Adaptation Data 


Message.Id 


Request Id 


Address_Mode 


2 Octet 

Length (Octets to follow. 


1 Octet 
(Fixed) 
0x60 


2 Octet 
NotUsed 


1 Octet 

0x01 = Addressable 


VTE Inserted > 


LctcI 2-De fined > 




Destination Address 


L2 DataCount 


Data_Type 


VSP Id 


6 Octet 

•MAC Address 


2 Octet 

# Octets to Payload End 
Variable 


1 Octet 

0x00 = User Data 


1 Octet 
0x01 = BIMS 






Transactlon_Code 


Transaction.Payload 


2 Octet 

Defines TX_Payload 
O18OIF 


Data Acknowledgement Field (TBD) 
Application Level Data 





TCP/IP Protocol Trailer 



CRC Checksum, etc 



VTE Inserted > 

Figure 20 : Acknowledgement Transaction Defined (As Sent by Staging Serrer) 
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3.5.4.3.1. UOP/IP Protocol Header 



The UDPflP protocol stack within the STB will be handling insertion of the UDP/IP headers and transport trailer for 
LI Pass-Through transactions transmitted upstream. The header is directly from the RFC 791 and RFC 768 
specifications. They are as follows: 



IP Header 


IP Version 1 Header Length I Type of Service 


Total Length 


Identification 


Flags I Fragment Offset 


Time to Live 1 Protocol 


Header Checksum 


Source IP Address 


Destination IP Address 




LJDP Header 


Source Port 


Destination Port 


Length 


Checksum 



Figure 21 : IP and UDP Header Definitions 



3. 5. 4. 3. 2, Data Compression on Upload 

The data compression algorithm LZW has been selected by PowerTV as a utility they will be providing to 
applications from the Operating System at some point in the future. They will be using LZW primarily for 
decompression of large BLOBs , graphics Tdes, data objects, etc. The Clickstream processor will also use this 
algorithm to compress the Gickstream buffer before data is uploaded through the system. 

The LZW Compression algorithm will be treated in detail in the next release of this document. 

3.5.4.3.3. Level 1 Pass-Through Mechanism to IMS 

This is currendy a work in progress. APIs have been denned but have not been delivered to BellSouth. 
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4. IMS Server Functions 

4.1. IMS So arced Logs 

The IMS will be the source for content Metadata regarding all session based services, and in addition will act as a 
subset of the Gickstream information gathered on the session based services offered. 

4.2. Pass Through on CUckstream 

This interface does little more than act as a gateway for the reverse path LI Pass-Through Transactions to flow. It 
does not manipulate the contents of the LI Pass-Through Transaction. The IMS will wait on an open connection to 
the "Event Capture'* process running on the Staging server, and will pass CUckstream data over that connection as it 
is received The interface to LI Pass-Through on the Staging server side is called by the following function call. 

Int IMSJULl.PassJThrough ( Int Size, VarBInary Payload); 
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5. Metadata Formats 

Content Metadata is information in an encoded form which describes the programming shown on the network and the 
time and broadcast or cable network on which the programming was shown. In the term "Content** we are including 
programming of virtually any type which is transmitted to the subscriber's Set-Top Box (STB). There are a number 
of sources for this information and it will be of utmost importance to manage this information appropriately. For 
this system, we will be interested in the merging of Content Metadata into our Clickstream data primarily for the 
analysis of Broadcast services provided by the Cable Television Application (Cable App) and VOD/NVOD usage. 
We will also be interested in the numbers of STBs tuned to other interactive services when analyzing ratings and 
general viewing habits of our subscriber population. The merging of Content Metadata into our Clickstream data 
adds the relevant information upon which most of our analyses will be based. 

The sources of Content Metadata for this system are shaping up to be: 

1) EPG System: General broadcast programming content. 

2) Broadcast Advertising: Advertising content occurrences on most national f local, and cable 
broadcast networks. This will not include some spot advertisements generated by local broadcasters, and will not 
include any spot advertisements generated within the BellSouth system. 

3) IMS System Logs : The IMS system will Log all stream occurrences, and the STB or STBs 
associated with the stream. This will primarily be in the form of VOD, NVOD, and interactive application 
execution (i.e. the Navigator). 

4) Advertising Traffic Controller: This system is still in very formative stages at the time^>C-- 
this writing. It will be a BellSouth system enabling the control of advertisement traffic through the BellSouth 
system throughout the interactive video trial. This system will hand off advertisement Metadata which is generated 
in-house by the insertion of advertisements in both broadcast and session-based services. 

There will also be system globally defined Metadata. There are lookup tables by which all sorts of other contextual 
information may be determined. These tables should be global to the BellSouth System were possible. Many, if 
not all of these tables are defined in this section. 

The Clickstream system will attempt to resolve data formats coming from the various sources by defining a global 
data storage format in the MKIS system, and resolving any variations from that global format with the Metadata 
providers before the system comes on-line. If all variations cannot be resolved at their sources, an additional parsing 
mechanism will have to be implemented. 

In addition, there will be Content which is reported by several different sources. We will have to resolve these 
"Duplicate Occurrences" for our Content data to be as accurate as possible. 



5.2. Globally Defined Metadata Formats 

Content Metadata is logged in a single main tabular form. The Metadata IDs are then resolved by several supporting 
data tables. In the MKIS system this data structure will take the form of a relational database model. 



5.1.1. Global Event ID Definitions 

A system global method for defining Event IDs is needed by every application to provide a context by which the 
event was important enough to be joumaled. This could be as simple as a broadcast channel change by pressing the 
Chan Up key. It could be a passive change in the content shown on the network. All of these event types are 
gathered into a single table for purposes of easing the data analysis process. No single application will make use of 
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every possible event type. In most cases it will benefit the system overall for applications to make use of as little 



EVENT DEFINITIONS 



Content Related Events 

0x0000 | Passive Change in Content 



Direct Key Presses 



0x0001 


TVoiTV Pressed 


0x0002 


Power Pressed 


0x0003 


One (1) Pressed 


0x0004 


Two (2) Pressed 


0x0005 


Three (3) Pressed 


0x0006 


Four (4) Pressed 


0x0007 


Five (5) Pressed 


0x0008 


Six (6) Pressed 


0x0009 


Seven (7) Pressed 


OxOOOA 


Eght (8) Pressed 


OxOOOB 


Nine (9) Pressed 


OxOOOC 


Zero (0) Pressed 


0x0000 


Channel Up Pressed 


OxOOOE 


Channel Down Pressed 


OxOOOF 


Volume Up Pressed 


0x0010 


Volume Down Pressed 


0x0011 


Last Channel Pressed 


0x0012 


Favorite Channel Pressed 


0x0013 


Guide Key Pressed 


0x0014 


Theme Key Pressed 


0x0015 


Display Key ftessed 


0x0016 


Mute Key Pressed 


0x0017 


A Key Pressed 


0x0018 


B Key Pressed 


0x0019 


C Key Pressed 


OxOOlA 


Info Key Pressed 


OxOOlB 


Backup Key Pressed 


OxOOiC 


Main Menu Key Pressed 


OxOOlD 


Up Arrow Key Pressed 


OxOOlE 


Down Arrow Key Pressed 


OxOOlF 


Rieht Arrow Key Pressed 


0x0020 


Left Arrow Key Pressed 


0x0021 


Enter Key Pressed 



0x0022 


Play Pressed 


0x0023 


Rewind Pressed 


0x0024 


Pause Pressed 


0x0025 


Record Pressed 


0x0026 


Fast Forward ftessed 


0x0027 


Stop Pressed 


Application/State Switching Related 


0x0028 


AC Power ON 


0x0029 


Application Switch (Normal) 


0x002A 


Application Switch (Abnormal) 


0x002B 


Application Terminated (Normal) 


0x002C 


Application Terminated (Abnormal) 


0x002D 


Soft Power OFF 


0x002E 


Soft Power ON 


0x0O2F 


OFF State Polling Event 


General 


0x0030 


Direct Channel Change 


0x0031 


Mute 


0x0032 


Un-Mute 


0x0033 


Volume Change Below 50% 


0x0034 


Volume Change Below 25% 


0x0035 


Volume Change Below 10% 


0x0036 


Volume Change Above 50% 


0x0037 


Volume Change Above 25% 


0x0038 


Volume Change Above 10% 


0x0039 


Change to Interactive Mode 


0x003A 


Change to Broadcast Mode 




Figure 22: Global Event ID Definitions 
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5*1.2* Timestamp Format 

The timestamp format for all Qickstream events throughout the system will be based on a 6 byte (48 bit) 
subset of the ANSI C and C++ standard TIME.T data structure. It will identify a unique time since 
the year 1900 down to a one second resolution. Some of our content Metadata providers will only be able to give ua 
data down to a one minute resolution, if this is the case, these Metadata records will have to be appended or modified 
defaulting the "second" field to 00. The following is an overview of the TIME_T data structure. 





<ttme.h> 






int 


tm sec 


seconds after the minute 


(0,61) 


int 


tm min 


minutes after the hour 


(0.59) 


int 


tmjiour 


hours since midnight 


(0*23) 


int 


tm mday 


day of the month 


(1x31) 


int 


tm_mon 


months since January 


(0r«) 


int 


tm_year 


years since 1900 


(0, 256) 



Figure 23: Global Time Format 



not used int tm_wday days since Sunday (0,6) 

not used int tm_yday days since January (0356) 

not used int tmjsdst Daylight Savings Time Rag 

The entire data structure as defined in the ANSI C spec is a full 9 Bytes. The last 3 bytes deal with defining the day 
of the week, the day of the year, and a daylight savings time flag. These could very easily be extrapolated after event 
generation in the MKIS system if needed, and it is not clear that they would be needed for any of the known 
analyses. The elimination of these three bytes from the Qickstream event decreases storage and bandwidth needs. 



5.1*3* Global Broadcast Channel Identifiers 

The following table defines a coding scheme for national and local broadcasters which will be carried on the 
BellSouth network during the Residential Broadband Trial. This list may grow as channels are added, but the 
identifiers will remain unique and constant for each channel. Any application joumaling off events which occur 
while subscribers are viewing broadcast television will be able to identify the network carrying the programming 
content by using a subset of this table. In this way channel lineups can be changed over the life of the residential 
broadband trial, (and presumably a larger scale BellSouth services deployment), and the identifier for a network 
would stay the same. Data repositories will be relieved of the task of mapping an ever-changing channel number i 
a network. 
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Broadcast Channel Identification Table 



0x0100 to 



0x01 1 F 



ews/Ta I k Shows 



10x0100 



CNN 



10x0101 



10x0102 



The Weather Channel 



10x0103 



CNBC 



10x0104 



CSPAN 



10x0105 



CSPAN-2 



10x0106 



America's Talking 



10x0107 



10x0108 



Court TV 



0x0109 



The Crime Channel 



I 0x0 10 A 



National Empowerment TV 



0x0120 to 



I0X013F 



Sports 



10x0120 



10x0121 



10x0122 



10x0123 



10x0124 



10x0125 



deadline News 



alk Channel 



SPN 



SPN-2 



SpottSouth 



The Go If Channel 



Classic Sports Network 



'rime Network 



0x0126 


NewSpoit 


|0x0140 to 




0x01SF 


Music 


0x0140 


MTV 


loxOWl 


VH-1 


lox0142 


Country Music Television 


lox0143 


The Nashville Network 


10x0144 


The Box 


lox0145 


Video Jukebox 


lox0146 


MOR Music TV 


Iox0l47 


Musk Choice 


|0x0160 to 




I0X017F 


Shopping 


lox0160 


QVC 


10x0161 


QVC-2 


10x0162 


Home Shopping Network 


10x0163 


TV Macy's 


|0x0164 


Catalog 1 


lox0165 


S, The Shopping Network 


|0x0166 


Cupid Network 


|0x0180 to 




lOxOIQF 


Movies 



10x0180 



Turner Classic Movies 



10x0181 



American Movie Classics 



10x0182 



TNT 



10x0183 



I Popcorn Channel 



10x01 AO to 



1 0x01 BF Religious 



10x01 AO 



Faith & Values Channel 



10x01 Al 



1 The Inspirational Network 



10x0 1A2 



|0x01A3 



Trinity Broadcasting Network 



I Eternal World TV Network 



10x01 A4 



| The Gospel Network 



10x01 CO to 



|0x01DF 



[Health & Self- 
llmprovement 



lOxOICO 



I Lifetime 



lOxOlCl 



Cable Health Club 



0x0 1C2 



The Health Channel 



|0x01C3 



Parent Television 



10x0 1C4 | Recovery Network / The Wellness 
1 Channel — 



0x01 E0 to 



10x01 FF 



ICultural/Ethnlc 



lOxOlEO 



IA&E 



lOxOlEl 



BRAVO 



10x01 E2 



I El 



10x01 E3 



Ovation 



|0x01E4 



BET 



|0x01E5 



I BET International 



|0x01E6 



BET on Jazz 



|0xOlE7 



I World African Network 



10x01 ES 



Unrvision 



|0xOlE9 



iGalavision 



lOxOlEA 



iTelemundo 



lOxOlEB 



GEMS 



lOxOlEC 



I intpmatipnal Channel 



10x0200 to 



10x021 F 



I Educational 



10x0200 



1 The Discovery Channel 



10x0201 



| The HistoryChannel 



10x0202 



| The Learning Channel 



10x0203 



1 Mind Extension University 



10x0220 to 



I0X023F 



iKIds 



10x0220 



Nkkelodean 



10x0221 



I Cartoon Network 



10x0240 to 
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OX025F < 


Seneral 


0x0240 


TBS 


0x0241 


The Family Channel 


0x0242 1 


JSA Network 


0x0243 I 


FV 


UXl/Z*H* 




UXUX**J 


ii/u/nn (kty\ 

W Yr\JI\ yn I ) 


UXUZ40 


w rlA ^fN I J 


UXUZ** / 




UXUZ45 


fcTTVT 
N.1 V 1 


0x0260 to 




0X027F 


Specialty 


0x0260 


Sci-Fi Channel 


0x0261 


Comedy Central 


0x0262 


Sega Channel 


0x0263 


Nostalgia Television 


0x0264 


Americana Television Network 


0x0265 


The Collector's Channel 


0x0266 


TV Food Network 


0x0267 


The Travel Channel 


0x0268 


ones Computer Network 


0x0269 


The Game Show Channel 


0x026A 


The Ecology Channel 


0x026B 


Home & Garden TV 


0x026C 


Kaleidoscope: America's Disability 
Channel 


UXUzoLJ 


l ne rvuiiiaiy v~naniici 


UXUzofc 


l ne navy v^naiinei 


UxUzor 


1 ne oingies v_naiuiei 


0x0270 


i ne ivtusician unannei 


0x0271 


Romance Classics 


0x0280 to 




0X02AF 


Premium Channels 


0x0280 


HBO 1 


0x0281 


HBO 2 


0x0282 


HB03 


0x0283 


Cine max 1 


0x0284 


Cinemax 2 


0x0283 


The Movie Channel 


0x0286 


Showtime 1 


0x0287 


Showtime 2 


0x0288 


Showtime Espanol 


0x0289 


Showtime Family 


0x028A 


Showtime Film 


0x028B | Showtime Comedy 



0x028C 


Showtime Action 


0x028D 


The Disney Channel 


0x028E 


Adam & Eve 


0x028F 


Spice 


0x0290 


Spice 2 


0x0291 


' Tieatre Vision 


0x0292 


Playboy 


UXUZVJ 


rvequcsi l v 


IXUZV4 


viewers no ice 


UXUZVO 


tout \_noice i v 


UXUZV / 


/^mUU V/i/Iaa Qtnr» 

v^aoie vioeo oiorc 


0x0300 to 




0x033F 


Local Broadcast 
Providers 


0x0300 


WSB 


0x0301 


WAGA 


0x0302 


WGTV. 


0x0303 


WXIA 


0x0104 

WAWJVt 


WTBS 


0x0305 - 


WPBA 


0x0306 


WATL 


0x0307 


WGNX 


0x0308 


WVEU 






ftvFFFO to 




OxFFFF 


Corner Cases / 
Error Conditions 


OxtH-hO 


Provider To Be Defined 




Provider Unknown 


OxFFFFF 


General Error 
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5.2. Content Metadata Formats 

Metadata regarding programming content provided in the BellSouth system is necessary to tie events which occur on 
the STB to what content the event may have been linked with. "How many subscribers changed the channel during 
every Coca-Cola commercial 7\ "Do subscribers channel surf during some programs more than others?*, etc. All 
possible queries against our usage data will be enabled only by our ability to tie content to the services provided. 
The following is a general format for all content Metadata. A main table (table 1) ties a piece of progr ammi ng 
content to a particular channel identifier between an accurate start and end time. The table 2 simply supports the ID 
scheme used in table 1 to minimize data storage and transmission capacities. Other Channel and Time encoding 
schemes are defined in section 5. 1 above. 





Content ID 


Channel ID 


Start Time 


End Time 


Examples 


(MH2345FF 


0x002F 


OxOOOOFEDCBAOO 


OxOO00FEDCBF98 




0x01234567 


0x002F 


0xOOO0FEDCBA98 


OxOOOOFEDCBFFF 








Table 1 





Content ID 


Content Unique Descriptor 


Content Type 


Examples 


0x01234568 


"MURPHY BROWN, #39" 


0x04 




0x01234567 


"HERBIE RIDES AGAIN" 


OxOF 



Table 2 





Content Type 


Content Type Descriptor 


Examples 


0x04 


"Situation Comedy" 


0x05 


44 Movie, Drama" 




0x06 


"A^ertisement" 


OxOF 


4< Movie, Comedy" 



Table 3 



Global Content Data Formats: 

Content ID : 32 Bit unique identifier for content 

Channel ID : 16 Bit unique identifier for programming network. 

Start Time: 48 Bit standard C / C++ * r nME_T* data format for description of time. 

End Time: 48 Bit standard C / C++ *TIME_T* data format for description of time. 

Content Unique Descriptor: Variable length character string describing programming content. 
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5.2.1. Electronic Program Guide Metadata Format 

This is an example of the types of data we will expect to be sourced by Prevue Guide during the Residential 
Broadband Trial period. Data formats can be expected to be more complex than that represented here. These data 
formats are expected to finalize over the next month. 

Prevue Guide Metadata is replicated forward from the Prevue Guide server to the Staging Server through a Client / 
Server replication RPC 





Content ID 


Channel ID 


Start Time 


End Time 


Examples 


CK012345FF 


0x002F 


OxOOOOFEDCBAOO 


0x0000FEDCBF98 


0x01234567 


0x002F 


0x0000FEDCBA98 


OxOOOOFEDCBFFF 



Table 1 





Content ID 


Content Unique Descriptor 


Content Type 


Examples 


0x01234568 


"MURPHY BROWN. «9" 


0x04 




0x01234567 


"HERBDE RIDES AGAIN" 


OxOF 



Table 2 



Electronic Program Guide Content Data Formats: 

Content ID : 32 Bit unique identifier for content 

Channel ID: 1 6 Bit unique identifier for programming network. 

Start Time: 48 Bit standard C / C++ 'TIMEJF data format for description of time. 

End Time: 48 Bit standard C / C++ *TIME_T* data format for description of time. 



5*2*2. Advertisement Metadata Format: 

The format of advertisement data to feed into the MKIS would take the form of two tables. The first table would be 
the rime/channel grid relating an advertisement identifier to a channel ID at a particular start and end time. The 
second would be a table relating advertisement identifiers to the actual advertisement content as described in a 
character string, the advertisement identifiers would describe an advertisement content for the life of the system,(i.e. 
IDs would not be recycled). The reporting agency would use our existing Channel ID metadata, and any other 
BellSouth standard encoding scheme for their content where possible. 
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Advertisement ID 


Channel ID 


Start Time 


End Time 


Example 


0x01234567 


0x002F 


0x0000FEDCBA98 


OxOOOOFEDCBAFF 



Table 1 





Advertisement ID 


Advertisement Unique Descriptor 


Example 


0x01234567 


"Coca Cola Corporation Ad # 1357" 



Table 2 



Advertising Data Formats: 

Advertisement ID : 32 Bit unique identifier for advertising content. 

Channel ID : 16 Bit unique identifier for programming network. 

Start Time: 32 Bit standard C and C++ 'TIME^' data format for description of time. 

End Time: 32 Bit standard C and C++ *TIME_T' data format for description of time. 

Advertisement Unique Descriptor Variable length character string describing advertisement 



5.2*3. IMS Log Metadata Formats 



5.2.4. BellSouth Advertising Traffic Control Metadata Formats 

The format of in-house advertisement data to feed into the MKIS would be identical to third -party sourced data. 





Advertisement ID 


Channel ID 


Start Time 


End Time 


Example 


0x01234567 


0x002F 


0x0000FEDCBA98 


OxOOOOFEDCBAFF 



Table 1 





Advertisement ID 


Advertisement Unique Descriptor 


Example 


0x01234567 


**Coca Cola Corporation Ad #1357' 



Table 2 



Advertising Data Formats: 

Advertisement ID : 32 Bit unique identifier for advertising content 

Channel ID : 16 Bit unique identifier for programming network. 

Start Time: 32 Bit standard C and C++ 'TIME JP data format for description of time. 

End Tune: 32 Bit standard C and C++ 'TIME JP data format for description of time. 

Advertisement Unique Descriptor Variable length character string describing advertisement 
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6. Staging Server Functions 

The Staging Server is a Sybase Conceptual Database entity which will serve many purposes in the trial. The main 
purpose in it's conception is a method to distribute many storage and communications processes off-line from the 
IMS which will be very busy with processor-intensive and time-critical actions controlling the VTE and handling 
signaling sessions. The actual hardware on which this system will reside may depend on performance we experience 
when the system comes on-line. In-depth documentation on the many functions of the Staging Server will be 
handled by Sybase eventually when these designs come to a state of completion. This section will act as a rough 
overview of the Staging Server processes involved in the Clicks tream System. 

6.1. Upload Communications Procedure: Event Capture 
6.1 A. RPC Endpoint 

The Staging Server will act as the endpoint of the initial upload procedure of STB Clickstream information. When 
IMS receives a Level 1 Pass-Through packet with Clickstream data, the packet will be addressed to a particular 
process running on the Staging Server. This "Event Capture" prococess rung on the Staging Server will maintain 
an open connection with the Level 1 Pass-Through running on IMS. * 

6.2. Persistent Storage 

The Event Capture process will provide for flat-file storage of da as is is delivered to the Staging Server over 

upstream communications. Data will then be decompressed and parsed from this flat file. 

6.3. Content Merge with Metadata 

Clickstream data must be brought together with contextual information on programming and advertising from both 
broadcast and interactive services. If this is done so in an effective manner, then analyses on the resultant data will 
provide an enormous amount of value to BellSouth. The processes of merging the data could be done in a number of 
different ways. The method advocated in this design document reflects a consideration for availability and format of 
data, system band widths, processing capabilities, storage requirements, and analysis capabilities. 
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BellSouth MKIS Database 



Raw data and Resultant Data 
far storage / retreival / ofHoad 



Figure 24 : Staging Server Overview 



The merging methods break down into three parts. ^ . 

1) Rrst a complete timeline is constructed for each Set-Top Box for an entire 24 hour "Day" period. Tins is 
done by inserting content identifiers into the CUckstream data, and by generating additional CUckstream data when 
content has changed and the subscriber continued to view the same programming channel or service. 

2) filtering is done to "weed out" any extraneous data. This part of the merging process will have to evolve 
over time as we learn more about the types of data which have proved to be valuable, and those which haven L 
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3) The timeline is further analyzed to provide a second timeline. This second data output is a list of content 
items "watched" by the STB and a list of content items "viewed" by the STB. These lists are generated based oq 
decision criteria provided by BellSouth Marketing and Advertisement departments. 



These steps are repeated for each Set-Top Box in the system. 



Output 



Clickstream 
Data 




Prevue Guide 
Data 



Broadcast 

Advertising 

Data 



Session-Services 
Advertising 
Data 



Session-Services 
Programming 
Data (IMS) 




'•Value-Added' 

Clickstream 

Timeline 



Merge 

& 
Parse 
Engine 



"View" and 

'Watch" 

Lists 



"View" and 
•Watch" 
Criteria 



'Value-Added 
Timeline 



Figure 25: Merging & Parsing Overview 



Merging and parsing should work on 24 hour segments of data at once to minimize the occurrence of un resolved 
events. This is to say that Clickstream Events are in and of themselves just pieces, and to do analyses on just 
several hours of data will leave the determination of programs being "watched" and "viewed" at the endpoints 
unresolved. 
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CHckstream Data 





Timestamp 


Application ID 


# Bytes to 
Follow 


Application Specific Data 
Event ID Channel ID 


Event Record 


3:59:30pm 


Cable App. 


4 bytes 


Soft Power On 


ABC 


Event Record 


4:00:00pm 


Cable App. 


4 bytes 


Channel Up 


NBC 


Event Record 


4:0£17pm 


Cable App. 


4 bytes 


Channel Up 


TBS 


Event Record 


4:06:25pm 


Cable App. 


4 bytes 


Channel Dwn 


NBC 


Event Record 


4:15:45pm 


Cable App. 


4 bytes 


Channel Up 


TBS 


Event Record 


4:55:45pm 


Cable App. 


4 bytes 


Soft Power Off 


None 



PreTue Guide Data 



2 





Content ID 


Channel ID 


Start Time 


End Time 


Content Record 


National News 


ABC 


3:30:00pm 


4:00:00pm 


Content Record 


Murphy Brown 


NBC 


3:59:00pm 


4:30:00pm 


Content Record 


Simpsons 


NBC 


4:30:00pm 


4:59:00pm 


Content Record 


N.G.E\plorer 


TBS 


3:05:00pm 


4:05:00pm 


Content Record 


Andy Grifith 


TBS 


4:05:00pm 


4:35:00pm 


Content Record 


NBA Basketball 


TBS 


4:35:00pm 


6:05:00pm 



Broadcast Advertising Data 



Content Record 



Content Record 



Content Record 



Content Record 



Content ID 



Coca Cola #10 



Visa #2 



Delta #1 



Delta #3 



Channel ID Start Time End Time 



NBC 



NBC 



TBS 



4:03:30pm 



4:04:00pm 



7 

< T Content Record | Visa #21 | TBS I 4:04:30pm I 4:05:00pm 



TBS 



4.00:30pm 



4:01:00pm 



4:04:00pm 



4:01:00pm 



4:01:30pm 



4:04:30pm 



otnbined Data 




-Parse and 

Merge 

Data 



* 



X 



^>U faerie 


Timestamp 


App* ID 


# Bytes to 
Follow 


Application Specific Data T 
Event ID Channel ID Content ID 


Event Record 


3:59:30pm 


Cable App. 


6 bytes 


Soft Power On 


ABC 


National News 


Event Record 


4:00:00pm 


Cable App. 


6 bytes 


Channel Up 


NBC 


Minphy Brown 


Event Record 


4:00:30pm 


Cable App. 


6 bytes 


Change Content 


NBC 


Coca Cola #10 


Event Record 


4:01:00pm 


Cable App. 


6 bytes 


Chance Content 


NBC 


Visa#T 


Event Record 


4:01:30pm 


Cable App. 


6 bytes 


Change Content 


NBC 


Minphy Brown 


Event Record 


4:03: 17pm 


Cable App. 


6 bytes 


Channel Up 


TBS 


N.G. Explorer 


Event Record 


4:03:30pm 


Cable App. 


6 bytes 


Change Content 


TBS 


Delta #1 


Event Record 


4:04:00pm 


Cable App. 


6 bytes 


Change Content 


TBS 


Delta #3 


Event Record 


4:04:30pm 


Cable App. 


6 bytes 


Change Content 


TBS 


Visa #21 


Event Record 




Cable App. 


6 bytes 


Change Content 


TBS 


■ N.G.Explorer 


Event Record 


4:05:00pm 


Cable App. 


6 bytes 


Change Content 


TBS 


Andy Grifith 


) Event Record 


4:06:25pm 


Cable App. 


6 bytes 


Channel Dwn 


NBC 


Minphy Brown 


Event Record 


4:15:45pm 


Cable App. 


6 bytes 


Channel Up 


TBS 


Andy Grifith 


Event Record 


4:35:00pm 


Cable App. 


6 bytes 


Change Content 


TBS 


NBA Basketball 


'Event Record 


4:55:45pm 


Cable App. 


6 bytes 


Soft Power Off 


None 
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7. Clickstream Sizing Estimates 

Two sets of numbers were put together to start to get a handle on possible data rates and storage needs for the 
network, and the back end systems to support the Clickstream system. 

7.1. Maximum Sizing Estimates 



This is a maximum possible approach to understand the absolute upper limit on clickstream bit rates, and storage 
capacities during the trial period. 

For the first analysis every STB in the system is being used 24hours a day, 7 days a week, and someone is changing 
broadcast channels once a second. This is an extreme scenario, but one which should present us with absolute 
maximum data and storage rates for the Clickstream system. 

Assumptions: • Maximum possible click rate after STB filtering = 

1 click per second uniformly distributed over 24hr x 7days . 

• 4000 subsribers for the trial period. 

• Uniform upload process through ap pr opri ate control mechanisms. 

• Average number of bytes per click = 14 uncompressed 

• Compression rate of 50% can be acheived for uploads. 

• 70 Broadcast channels in system 
Conclusions: • Max number bits/second uploaded through system: 

224,000 bits/second for system 
7,000 bits/second per RT 

28,000 bits/second per Slotted- Aloha Modulator 

( 18% of gross bandwidth, about 36 % of net bandwidth) 

• Max number bytes of raw data to be stored per day: 

• Clickstream: 4.838 Gigabytes per day 

• Content Metadata: 2 Megabytes per day 

7.2. Reasonable Sizing Estimates 
For the second analysis, every STB in the system is being used 24 hours a day, 7 days a week, and someone is 
changing the broadcast channel once a minute. This is a more reasonable scenario in that during peak "channel 
Surfing" data rates are likely to be higher on a per STB basis. This is balanced with the fact that neither duration of 
viewing or overall system viewship rates are expected to every be this high. 

Assumptions: • Reasonable click rate after STB filtering = 

1 click per minnte uniformly distributed over 24hr x 7days 

• 4000 subsribers for the trial period. 

• Uniform upload process through appropriate control mechanisms. 

• Average number of bytes per click = 14 uncompressed 

• Compression rate of 50% can be acheived for uploads. 

• 70 Broadcast channels in system 
Conclusions: • Max number bits/second uploaded through system: 

3,733 bits/second for system 
117 bits/second per RT 

447 bits/second per Slotted-Aloha Modulator 

(.03% of gross bandwidth, about .06% of net bandwidth) 

• Max number bytes of raw data to be stored per day: 

• Clickstream: 80 Megabytes per day 

• Content Metadata: 2 Megabytes per day 
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8. MKIS System 

8.1. Persistent Storage 

8.2. Hardware Interfaces 

8.2.1. Staging Server 

8.2.2. Cable Data 

8.2.3. IMS (indirect?) 

8.2.4. I3/Optimark Upload 

8.2.5. 3rd Party Analysis Upload 

8.2.6. Marketing Report Generation Ethernet Interface 

Content Merge Engine - 

8.3.1. Prevue Metadata 

8.3.2. Advertisement Metadata 

8.3.3. Advertising TrafHc Controller Metadata 

8.3.4. IMS Generated Metadata 
8.4. A nalysis Capabilities 

8.4.1. Rough Overview 
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9. Defining Terms 

The following are some terms used in this document and alternatives you may have encountered elsewhere: 

13: Formerly CONDO : Consortium of New Database Operators. A group of companies which will be providing an 
Analysis capability for Clickstream information, and will act as an interface between the advertising community and 
BellSouth. 

Clickstream System: A deraographics/programming ratings collection system which will function on the 
BellSouth IVSN. This document is an initial attempt to define the scope of this functionality, and the means we 
may use to make it happen. 

STB: Set-Top Box. The "Cable Converter" that sits on 'Top" (in this case possibly on bottom) of the 

subscribers television set. The device provides a platform on which content is converted to the NTSC video format 
and presented to the subscriber, and input is taken from the subscriber and transmitted to the cable operator. 
Analogous Terms: Set-Top Terminal (STT), Cable Converter, Home Communications Terminal (HCT), 
"Richmond"(Scientific- Atlanta model name for this device), 8600X, 86OOXD1. 

Subscriber The end user of the system, the person who subscribes to the service. Traditionally in consumer 
electronics the "consumer" is the end user of the products or systems produced. In Cable Television the "consumer** 
is typically a cable operator or large Multiple System Operator (MSO), so the term subscriber was adopted 

Upstream: Generic term used for a data path provided by any mechanism FROM the subscriber to the headend, 

server, or billing and management databases. If the system is viewed as billing and management databases at the 
top, video server and headend below, Level I system below that, and finally the subscriber and the STT at the 
bottom, then data flows "Upstream" or "Downstream". Analogous Terms: Reverse data path. 

TDMA, slotted ALOHA: The two mechanisms for upstream communication from the STT to the server . 

Downstream: Generic term used for a data path provided by any mechanism FROM billing or management 
computer to server or headend, and FROM server/headend to the subscriber/STB. Analogous Terms: Forward data 
path. 

Event: An action or change in a STT state deemed important to the building of a knowledge base on the 

subscriber. Examples include key presses to change channels, enter Navigator, turn STT off/on, etc. Also includes 
passive changes such a change of programming content while STT tunes single channel such commercial break, 
changes to different commercials, changes to next program, etc. 

Clickstream: Term used to describe a flow of STT event data upstream to the HP server complex. The 
Clickstream then flows further upstream to the 13 database where demographic and programming ratings intelligence 
can be gathered and formatted. 

ATOM: A content identifier coding scheme. ATOM codes will be used to determine programming ratings 

and subscriber viewing habits. 

PowerTV: The operating system used on this model of STB. It must provide for most, if not all of 

applications functionality through die use of APIs. 
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